Improved unsupervised NAP training dataset design for speaker recognition

نویسندگان

  • Hanwu Sun
  • Bin Ma
چکیده

The Nuisance Attribute Project (NAP) with labeled data provides an effective approach for improving the speaker recognition performance in the state-of-art speaker recognition system by removing unwanted channel and handset variation. However, the requirement for the labeled NAP training data may limit its practical application. In our previous study, a simple unsupervised clustering algorithm based on dot products between supervectors was introduced for designing NAP training dataset without a prior knowledge about channel and speaker information. Using such clustering results as the initial training dataset, in this paper, we make a further improvement of the training dataset by enhancing similarity measurement of supervectors via NAP projection and score normalization. The effectiveness of this unsupervised NAP training dataset design strategy has been verified in the experiments using the in-house development dataset of IIR submission for the 2012 NIST SRE.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised NAP Training Data Design for Speaker Recognition

The Nuisance Attribute Projection (NAP) with labeled data provides an effective approach for improving the speaker recognition performance in the state-of-art speaker recognition system by removing unwanted speaker channel and handsets variation. However, the requirement for the labeled NAP training data may limit its practical application. In this paper, we propose an unsupervised clustering s...

متن کامل

NAP, WCCN, a New Linear Kernel, and Keyword Weighting for the HMM Supervector Speaker Recognition System

We demonstrate the application of Nuisance Attribute Projection (NAP), Within-Class Covariance Normalization (WCCN), a new standard kernel, and keyword weighting for the keywordbased HMM supervector speaker recognition system. On our development set (SRE04 8-side training), we achieve 22.6% and 16.2% EER improvements using NAP and WCCN respectively, a 19.5% EER improvement using NAP and WCCN jo...

متن کامل

Discriminant NAP for SVM speaker recognition

Nuisance Attribute Projection (NAP) provides an effective method of removing the unwanted session variability in a Support Vector Machine (SVM) based speaker recognition system by removing the principal components of this variability. There is no guarantee with the methods proposed, however, that desired speaker variability is retained. This paper investigates the possibility of training NAP di...

متن کامل

ALIZE/spkdet: a state-of-the-art open source software for speaker recognition

This paper presents the ALIZE/SpkDet open source software packages for text independent speaker recognition. This software is based on the well-known UBM/GMM approach. It includes also the latest speaker recognition developments such as Latent Factor Analysis (LFA) and unsupervised adaptation. Discriminant classifiers such as SVM supervectors are also provided, linked with the Nuisance Attribut...

متن کامل

Weighted Nuisance Attribute Projection

Nuisance attribute projection (NAP) has become a common method for compensation of channel effects, session variation, speaker variation, and general mismatch in speaker recognition. NAP uses an orthogonal projection to remove a nuisance subspace from a larger expansion space that contains the speaker information. Training the NAP subspace is based on optimizing pairwise distances to reduce int...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013